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We consider a class of random, weighted networks, obtained through a redefinition of patterns in an 
Hopfleld-like model and, by performing percolation processes, we get information about topology and 
resilience properties of the networks themselves. Given the weighted nature of the graphs, different 
kinds of bond percolation can be studied: stochastic (deleting links randomly) and deterministic 
(deleting links based on rank weights), each mimicking a different physical process. The evolution 
of the network is accordingly different, as evidenced by the behavior of the largest component size 
and of the distribution of cluster sizes. In particular, we can derive that weak ties are crucial in 
order to maintain the graph connected and that, when they are the most prone to failure, the 
giant component typically shrinks without abruptly breaking apart; these results have been recently 
evidenced in several kinds of social networks. 
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I. INTRODUCTION 

Network theory is a fundamental tool for the modern 
understanding of complex systems: by a simple graph 
representation, where the elementary units of a sys- 
tem become nodes and their mutual interactions become 
links, a lot of properties about the structure and dynam- 
ics of the system itself can be inferred [1 . 

Recently, the characterization of network dynamics has 
become a central issue: networks are intrinsically dy- 
namic and continuously accommodate novel members, 
lose their original elements, as well as build, erase and 
rearrange their links [2 j. The structural reorganization 
of networks may arise due, e.g., to a change in the re- 
sources providing the energy to maintain their links, or 
to a large stress [3]|4]. In this context, percolation [5j|6] 
constitutes a very interesting process able to mimic a fail- 
ure or a damage of links/nodes. Moreover, percolation 
represents one of the simplest example of dynamical pro- 
cess on a graph, exhibiting a phase transitions [7 and, 
indeed, it has been mapped into several other critical 
phenomena; as well, applications in epidemiology, traf- 
fic models and in the analysis of technological networks 
resilience have been deeply studied [5UTT]. 

Here we apply percolation processes as a means in or- 
der to probe the topology and the resilience of a net- 
work itself. We especially focus on a class of stochas- 
tic, weighted networks Q recently introduced in [12J[T3]. 
Such networks are generated by assigning to each node a 
set of attributes and by linking two nodes whenever the 
pertaining attributes are similar enough; the larger the 
similarity, the stronger the link. As shown in [l2j[T3], the 
resulting class of (weighted) networks Q exhibits interest- 
ing properties such as imitative interactions (by construc- 
tion), degree-degree correlation, high transitivity (i.e. a 



large clustering coefficient) and a properly tunable topol- 
ogy through a parameter 0, which controls the distribu- 
tion of attributes. Therefore, such networks constitute 
an efficient tool to describe several different systems that 
belong to disparate contexts, ranging from biological net- 
works p~4j [15] , to technological structures [16] and to so- 
cial organizations [T7HT9] . 



Now, since the graphs under investigation are 
weighted, we can perform different kinds of percola- 
tion processes: random (where links are deleted in a 
purely random fashion), and deterministic (where links 
are deleted in rank order from the weakest to strongest, or 
vice versa). We especially focus on graphs Q obtained for 
6 = and 6 = 0.25, corresponding (in the limit of large 
size) to fully connected weighted networks and account- 
ing for an "unbiased" and "biased" pattern distribution, 
respectively. First of all, we consider the relative size of 
the largest connected component S as a function of the 
fraction / of links left: numerical data suggest that a "gi- 
ant component" emerges when the fraction / approaches 
a "critical" value / c , which is found to scale with the 
system size according to V~ v where v depends on the 
kind of dilution. The latter also controls the sharpness 
of the percolation, as well as the distribution of cluster 
sizes, showing that, for biased pattern distributions, weak 
ties play a crucial role as they can be used to build up 
a spanning tree, conversely, strongest links are typically 
redundant, as a result, if weak ties are the most prone to 
failure the system will exhibit a poor resilience. 

The paper is organized as follows: in Sec.[ll]we describe 
the correlated random networks we are focusing on, as 
well as the percolation processes we perform. Then, in 



Sec. [El] we present the basic probability relations con- 
cerning the coupling distribution, from which we can in- 
fer qualitative information on the properties of the per- 
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eolation transition; these properties are confirmed by the 
results of the numerical analysis that is reported in the 
following sections. In particular, the behavior of the gi- 
ant component is studied in Sec. |IV[ the distribution of 
cluster sizes is described in Sec. [Vl and the behavior of the 



clustering coefficient is examined in Sec. VI An overall 
discussion on the results and on the perspectives of our 



work is contained in Sec. VII In Appendix A we show 
that the class of graphs that we consider displays dissor- 
tative mixing in a wide region of the parameter set, while 
in Appendix B we present some analytic results which are 
valid for the random percolation process on Q. 



II. THE MODEL 

We first introduce the class of networks on which we 
focus our analysis and later we describe the percolative 
processes we will perform on such networks. 



A. Network generation 

Recently, a new approach to generate correlated ran- 
dom networks has been introduced [12j [13] ; the approach 
is based on a simple shift [—1, +1] — >> [0, +1] in the defi- 
nition of patterns in an Hopfield-like model and it allows 
to generate a broad variety of different topologies rang- 
ing from fully-connected to small-world, to extremely di- 
luted. 

More precisely, we consider a set of V nodes, each 
endowed with a set of L attributes encoded by a bi- 
nary string £; the ensemble of strings is extracted ac- 
cording to the probabilities P(£ M = 0) = (1 — a)/2 and 
P(£ M = 1) = (1 + a)/2, where the fixed parameter a be- 
longs to the interval [—1,1]. Then, the coupling between 
two generic nodes i and j is given by the rule 



(i) 



Therefore, the wider the overlap between non-null en- 
tries and the larger the weight associated to the link, 
with Jij G [0,L]; the extreme case Jij = means that 
there exists no link between nodes i and j. The values 
taken by the strings components admit the following in- 
terpretation: = 1 means that agent i is endowed with 
the particular feature /i, this feature can represent a bi- 
ological trait or an individual attitude according to the 
considered system (the absence of this particular feature 
corresponds to £f = 0). Then Eq. [I] states that agents 
show homophily. For example, in social networks, people 
interact with others of similar age, income, race, etc. 

As shown in [12j [13] , the way a node is connected to 
the network is sensitively affected by the number p of 
non-null entries present in the pertaining string, that is, 
for the z-th node, pi = (notice that since p is 

Poissonian, its average is given by p = L(l + a)/2). In 



fact, one finds that the average probability P\i n k(Pu a) 
that i is connected to another generic node, reads as 



I- a 



Moreover, by averaging over all possible string arrange- 
ments, one finds for the average link probability p be- 
tween two generic nodes 



p=l 



1 -ha 



The class of networks that are generated in this way 
exhibit different levels of correlation. For instance, it is 
easy to see [12j [13] that two neighbors of a given node 
are more likely to be connected than they would be if the 
graph was purely random generated; this kind of transi- 
tivity also affects the weights associated with the links 
[I2j [13]. Such networks also display a dissortative be- 
havior. Indeed, the nodes having strings with small p 
typically possess a small coordination number and they 
are more likely to be linked with nodes with large p. The 
mathematical aspects of the degree correlations are elab- 
orated in Appendix A. 

Finally, we introduce the parameter a = L/V which 
turns out to crucially control not only the topology but 
also the thermodynamic of the system [12] [13] . Here we 
assume a to be constant and finite, which means that, 
as the volume of the system grows, the length of the 
string increases proportionally; this corresponds the the 
so-called high-storage regime in neural networks [20] . In- 
terestingly, as V —> oo there exists a vanishingly small 
range of values for a giving rise to a non-trivial graph; 
such a range can be recognized by the following scaling 



-1 



_7_ 



(2) 



where > and 7 is a finite parameter. As explained in 
[T3] , controls the connectivity regime of the network- 
ranging from fully connected (FC, < < 1/2) to ex- 
tremely diluted (1/2 < < 1) to completely disconnected 
(0 > 1), while 7 allows a fine tuning. In particular, here 
we focus on < 1/2 and 7 < 2, corresponding to a FC 
regime: in this case topological disorder is lost, while dis- 
order on couplings is still present; however, notice that 
for = and 7 = 2, the coupling distribution gets peaked 
at J = L and disorder on couplings is relaxed as well. 

In the following, we will refer to the weighted random 
graph, generated as explained above, as (/(a, 0, 7, L), 
hence highlighting the dependence on the set of param- 
eters which control its size and its topology. We also 
anticipate that we will focus only on the cases = and 
= 0.25 corresponding to weighted, complete graphs. Of 
course, for these cases there is no topological correlation 
among links (the clustering coefficient is equal to 1 and 
assortativity is neutral), though correlation among link 
couplings is retained. 
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B. Percolation processes 

Given an arbitrary graph, bond percolation consists in 
deleting the existing links with some probability 1 — / 
or, in other terms, in occupying links with probability 
/; nodes connected together form clusters. When / ex- 
ceeds a given system-dependent threshold (or critical) 
value / c , a macroscopic cluster, i.e. a cluster occupying a 
finite fraction of all available sites, also called giant com- 
ponent, is formed. For various network architectures and 
space dimensionalities this transition is typically contin- 
uous, or second-order, as the system properties changes 
continuously at the critical point [H[2T]. For instance, on 
random networks a la Erdos-Renyi (ER) [22], one starts 
from a set of V nodes and adds links such that the prob- 
ability / that two nodes are joined by a link is the same 
for all pairs of nodes. When / < 1/V, the largest com- 
ponent remains miniscule, its number of vertices scaling 
as logV; in contrast, if / > 1/V, there is a component 
of size linear in V. Thus, the fraction of vertices in the 
largest component undergoes a continuous phase transi- 
tion at / = 1/V. 

As explained before, in the graph under study 
quenched weights are assigned to the edges and this al- 
lows to think of different kinds of processes, each cor- 
responding to different physical situations: The deletion 
of a link may mimic the failure of the link itself due to 
overload [4 or, rather, to error or attack which may af- 
fect randomly any link [25] . In the former case links 
with higher weight are the first to be deleted, while in 
the latter case deletion occurs randomly. In other kinds 
of situations we can think that nodes transfer a signal to 
neighbors and the passage of information is effective only 
when the tie strength is larger than some noise level [24 . 
Therefore, as the level of noise grows, more and more 
links starting from the weakest ones, get ineffective. To 
summarize, we deal with the following processes: 

• Random percolation (RP): starting from the origi- 
nal graph Q{ol, 9, 7, L) we consider each link and we 
remove it with probability 1 — /, independently of 
the couple of adjacent nodes, in such a way that / 
is the fraction of links left; as / is tuned from to 
1 we range from a completely disconnected graph 
to the original graph. 

• Deterministic- Weak percolation (WP): starting 
from Q(a,6,^,L), we remove all links with weight 
smaller than a given threshold l\ that is to say, as 1 
is tuned from to L, we remove links in rank order 
from the weakest to strongest ties. 

• Deterministic- Strong percolation (SP): starting 
from Q(a,Q,^,L), we remove all links with weight 
larger than a given threshold i\ analogously to the 
previous case, this corresponds to remove links in 
rank order from the strongest to the weakest ties. 

In order to evaluate the impact of removing ties, we 
measure the relative size of the largest connected compo- 



nent S, providing the fraction of nodes that can all reach 
each other through connected paths, as a function of the 
fraction / of links left /. We also measure the average 
squared size S = Yl^=i n s s<2 /V, where n s is the number 
of clusters containing s nodes. According to percolation 
theory, if the (infinite) network collapses because of a 
phase transition at f c , then S diverges as / approaches 

f- urn- 



III. COUPLING DISTRIBUTION 

The coupling distribution P coup \(J; a, L) plays an im- 
portant role as for deterministic processes, so that it is 
worth recalling some previous results [12] and deepening 
its dependence on the system parameters. 

The probability for two strings & and £j (with pi and pj 
non-null entries) to be connected by a link with weight J 
is just the probability that the strings display J effective 
matchings; this has been found to be [T2] 



-Pmatch (J 5 Pi 5 Pj 1 L) ~ 



(j) (pi-j) (pj-j) 
\PiJ \pjj 



(3) 



from which we can write that, in the average, the cou- 
pling distribution reads off as 

L L 

-^match 

Pt =o Pj =0 

x P 1 (p i ;a,L)P 1 (p j ;a,L), (4) 

being P\{p; a, L) = Q [(1+a) /2}p[(1-cl) /2} l ~p the prob- 
ability that a given string displays p non-null entries. 
Therefore, we get 



Pcou P i(J>,L) = (^)(a + l)- 2L 



(5) 



where we called a = (l + a)/(l — a). Since we are focusing 
on the case L = aV, with a string bias a given by Eq. [2] 
it is convenient to rewrite the coupling distribution as a 
function of the effective parameters, namely 



-^coupl 
L L 



EE 

pi=0 pj=0 



L — J 

Pi ~ J 



L 

-Pi 



7 



2L 



pj- J) [2{L/a) 



2(L/a) e _ 

7 



(6) 



' 7 



pi+pj 



The previous expression shows that for systems large 
enough, and a and L fixed, the distribution gets peaked 
at smaller J as is increased (when > 0.5 only cou- 
plings with value or 1 display non vanishing proba- 
bility) and the same holds for fluctuations. Moreover, 
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FIG. 1: (Color on line) Coupling distributions 

P C oup\(J, a, 7, L) from different values of L (depicted 
in different colors, as shown by the legend) and for 7 = 1, 
a = 0.1, 6 = (left panel) or = 0.25 (right panel). Curves 
represent Eq. [6] 




FIG. 2: (Color on line) Log-log scale plot of the aver- 
age coupling J(a,#,7,L) (main figure) and its fluctuations 
AJ(a,0,7,L) (inset), for = (O) and = 0.25 (□), as 
shown in the legend. Symbols represent numerical data, while 
curves represent analytical estimates from Eq. [8] and Eq. [9] 
respectively. 



a link is absent with probability P coup i(0; a, 7, L) = 
[1 - 7 2 /4(a/L) 2(9 ] L , which decays to zero for 6 < 0.5. 

In particular, in the following analysis we assume 7 = 
1, a — 0.1 and = or — 0.25; for = we can write 
explicitly 



Pcou P i(J;0.1,0,l,L) = ^2- 2L 

L L 1 

EE 

Pi=0 pj=0 



( Pi - J)\{pj~ J)\{L- Pi - Pj + J)V 



(7) 



and similarly for the latter. In Fig.[T]we show a compari- 
son of the two cases where numerical data are fitted with 
curves given by Eq. [6] Data corroborate that, even at 
relatively small sizes, the analytical formula above pro- 
vide a good approximation and that the distribution gets 
broader for larger L and smaller 6. More precisely, we 
calculate the average coupling J(a, 0, 7, L) and its fluc- 
tuations AJ((a, 0, 7, L)) as 



J(a,0,7,L) = ^2 J p covLpi(J',ot,0,'y,L) 



7 2 L 



,(8) 



j=o 



4 (L/a) 2e 

A J(a, 0, 7, L) = £)(J - J) 2 P C oupi( J; a, 0, 7, L),(9) 



j=o 



where for the closed form expression in Eq. [8] we used 
Eq.[6] Relevant results are shown in Fig. [2] where, again, 
the comparison between analytical estimates and numer- 
ical data is successful. 

Interestingly, from the width of the distribution one 
can infer information about the sharpness of the deter- 
ministic percolation: a broader distribution is expected 



to give rise to a less sharp transition. Moreover, we no- 
tice that the case 7 = 1 and = corresponds to a = 0, 
namely it corresponds to an unbiased distribution for 
strings, and this yields to a rather symmetric coupling 
distribution: as a consequence, SP and WP are expected 
to behave similarly. Conversely, when the coupling dis- 
tribution is not symmetric, as for = 0.25, different be- 
haviors emerge. All these points are deepened in the next 
section. 

Finally, we notice that for = 0.25 relatively small 
sizes give rise to non-fully-connected structures, that is, 
the coupling probability is non-null for J = 0. As we 
derived from Eq. [6j the probability that a link is absent 
decreases slowly with the size and such finite-size effect 
gets negligible only for V ~ 10 5 . Indeed, we find that 
finite-size effects enhance the skewness positivity of the 
distribution. 



IV. PERCOLATION TRANSITIONS 

In the following analysis we generate the graph 
G(ol, 6, 7, L) and, while performing a dilution process (ei- 
ther deterministic or random), we measure the number 
of clusters and their size; such results are then averaged 
over 10 2 realizations of G(a,6,~f,L) in order to account 
for the stochasticity of the graph itself. As explained 
in the previous section, in the thermodynamic limit both 
= and = 0.25 give rise to fully connected structures, 
so that, for large enough sizes, the random percolation 
process recovers the well-known results holding for ER 
graphs [26] . 

In order to evaluate the impact of removing ties, we 
measure the relative size of the largest connected compo- 
nent Sasa function of the fraction of links left /. Results 
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obtained for 6 = and 6 = 0.25 are shown in Fig. [3] and 
Fig. [4j respectively. 

Let us comment results of Fig. [3] First of all, we notice 
that when weak links are deleted first the graph starts to 
be disconnected (S < 1) at a value of / rather large, that 
is, weak ties are crucial to maintain the overall connec- 
tion of the graph and, in this sense, they work as bridges. 
Moreover, the WP transition is smoother than the one 
obtained from a random deletion of edges. This suggests 
that the deletion of weak ties yields the disconnection, 
from the giant component, of single nodes (indeed those 
displaying small p) or of small clusters. Otherwise stated, 
as / is increased from to 1, we first connect nodes dis- 
playing large overlap, hence forming a strong main com- 
ponent, while nodes with small p are likely to remain 
isolated or to form small clusters, which are successively 
annexed to the giant component: the process is therefore 
quite gentle. On the other hand, when links are intro- 
duced randomly, clusters grow up in a more uniform way, 
so that links merging disjoint components can give rise to 
a faster increase in the size of the giant component. As 
for the SP process, when = and 7 = 1, strings are ho- 
mogeneously distributed (a = 0), so that, as mentioned 
above, no qualitative differences are expected between 
SP and WP; in particular, in this peculiar case (a = 0) 
strong links also turn out to be crucial in maintaing the 
graph connected: being / the largest fraction of links for 
which S < 1, we get f RP < f S p < fwp- 

Let us now consider results for 6 = 0.25 shown in 
Fig. [4j As for the RP, slight quantitative changes in 
S(f) with respect to the previous case are due to finite 
size effects, while deterministic processes (DP) are also 
affected by the positive skewness of the coupling distribu- 
tion. More precisely, strings now display only rare non- 
null entries so that small components (typically made up 
of very close or even identical strings) can arise during 
a WP (this explains the sharper transition); also, strong 
ties are rather unlikely and their deletion does not mod- 
ify the connection of the giant component so that we can 
derive that they are redundant, that is, they typically 
do not participate to the spanning tree. For the sizes 
considered here we now have fsp < Irp < fwp- 

As anticipated, the occurrence of a percolation transi- 
tion is envisaged by a singularity in the average squared 
size S at / c , due to a network collapse as / approaches 
f c . We get consistent estimates for f c by evaluating the 
value of / corresponding to a maximum in the derivative 
of S and to the singularity in S. Results for 6 = and 
= 0.25 are shown in the insets of Fig. [3] and Fig. |4j 
respectively; the sets of values for f c have also been fit- 
ted with power law functions. Of course, for the RP the 
expected exponent v = — 1 is recovered [5,, while for WP 
and SP when = we find comparable exponents ~ 0.5; 
for = 0.25 finite size effects prevent to get sound es- 
timates for although a different behavior of SP with 
respect to the other cases is apparent. In any case, the 
RP corresponds to smaller values of / c , meaning that 
a smaller fraction of links is necessary to carry a giant 




FIG. 3: (Color on line) Main figure: Relative size of the 
largest connected component S versus the fraction of links left 
/ for a system of size V = 5700 and = 0. Fluctuations on 
these data, obtained by averaging over several realizations of 
the structure, are approximately 4%. Inset: f c versus system 
size; symbols represent numerical data, while curves represent 
the best fit given by a power-law with exponents vrp ~ 1 
and vwp ~ vsp ~ 0.5. Different percolation processes are 
compared as shown by the legend. 



component, that is, a smaller degree of redundancy is 
retained. 

Finally, some analytical insights for the case of arbi- 
trary and random dilution are presented in the Ap- 
pendix B, where we show consistency with known results 
about non-correlated networks. 



V. CLUSTER DISTRIBUTIONS 

While previous results offer a global description of the 
network dynamics, in this section we focus on the evo- 
lution of the internal organization of clusters by measur- 
ing the distribution N(p, s), representing the number of 
nodes corresponding to a string with p non-null entries 
and belonging to a cluster of size s. 

In Fig. [5] we show three sets of snapshots of N(p, s) for 
the case 6 = 0; each row represents a different dilution 
process (from top to bottom: RP, WP, SP), while each 
column represents a different regime (from left to right: 
f < fc: f ~ fc: f > fc)- While for the random dilution 
intermediate and high dilution regimes (panels b and c) 
are characterized by the existence of several clusters of 
different sizes, for deterministic dilution (panel e, / and 
h, i) a node basically either belongs to the largest com- 
ponent or is isolated. Moreover, in the former case due 
to the homogeneity underlying the process, curves are all 
peaked at around p namely, the set of attributes char- 
acterizing a given node does not affect the cluster size 
the node belongs to. Conversely, for WP (SP), larger 
(smaller) values of s yields distributions N(p, s) peaked 
at larger values for p. In particular, when weak ties are 
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FIG. 4: (Color on line) Main figure: Relative size of the 
largest connected component S versus the fraction of links left 
/ for a system of size V = 5700 and = 0.25. Fluctuations 
on these data, obtained by averaging over several realizations 
of the structure, are approximately 4%. Different percolation 
processes are compared (the legend is the same as in Fig. |3j). 
The inset on the right highlights the case of SP, where a jump 
in S occurs around / ~ 0.1; notice that this jump results from 
the fact that / cannot be continuously tuned (see also Fig.[T]). 
Inset on the left: f c versus system size; symbols represent 
numerical data, while curves represent the best fit given by a 
power- law with, in particular, vrp psj 1. 



the most prone to failure, nodes displaying strings with 
large p are the most likely to belong to the giant compo- 
nent. 

Analogous results for the case = 0.25 are depicted 
in Fig. [6] Now, in the intermediate regime, also for the 
WP a few small clusters emerge, while the abrupt jump 
in S(f) evidenced for the SP is recovered here by the fact 
that s assumes only two values: either 1 or V. 

We conclude this section with a remark. By focusing 
only on the topology of the graphs G(ql, 7, L) / 5 p, for a 
fixed parameter set and fixed /, we can compare the level 
of organization of the graph resulting from a different 
percolative process P. This can be attained by means of 
entropy measures [27], which, given a particular ensem- 
ble, provide the normalized logarithm of the number of 
networks in that ensemble, hence estimating how effec- 
tive the features characterizing the ensemble are. Here, 
we can fix the parameter set (a, 0, 7, L) and / (this some- 
how fixes the "energy" of the system) and measure the 
entropy within a configuration approach, namely working 
out the degree sequences; this approach is also related to 
a hidden variable model [13j [27] , consistently with the 
assignation of attributes. For instance, for = 0, when 
the dilution is low (most links still present) the entropy 
of the ensemble WP is expected to be larger due to the 
presence of a few isolated nodes which yield a larger num- 
ber of configurations; vice versa, when the dilution is in- 
creased the entropy of the ensemble RP is expected to 
prevail. Further analysis on this point may lead to spec- 



ulate that a failure of a limited number of nodes is likely 
to involve only weak ties, while when the failure is wider 
it is more likely to involve any generic node. Similar rea- 
soning can be extended in order to account also from an 
energy contribution due to the coupling. 



VI. CLUSTERING AND CORRELATIONS 

In this section we want to focus the attention on the 
properties of clustering and of correlation among links as 
we dilute them. 

Before proceeding it is worth recalling that the cluster- 
ing coefficient C provides a measure of the transitivity of 
the graph and it can be calculated as the average over 
nodes i of the local clustering coefficient d, defined as 
the actual number of links between the vertices within 
the neighborhood of z, divided by the maximum number 
of links that can exist between them, that is 

C'ht^ht^bry do) 

2=1 2=1 U % } 

where E{ is the number of links among nodes which are 
connected to i (node i is not included), Zi is the number 
of neighbors (also called degree) of i and one convention- 
ally sets Ci = for zi = 0, 1. A graph is often referred to 
as small- world, if its diameter is small (scaling as log V, 
which is is verified by Q) and its average clustering coef- 
ficient is significantly higher than the one relevant to a 
random graph constructed on the same vertex set, mean- 
ing C = f. As evidenced in [I2j [13], the graph under 
study can be defined as small-world. 

As shown in Fig. [7] (upper panel), when = 0, the 
clustering coefficient relevant to the graph obtained with 
a random dilution just corresponds, as expected, to the 
coefficient pertaining to an analogous ER graph; con- 
versely, the clustering coefficients for deterministic dilu- 
tion are larger and display a less trivial profile. In fact, 
for the RP, starting from small / we first build up a set of 
uncorrelated small components having zero or very small 
coefficient, so that their contribution to C is negligible; a 
significant and regular increase in C is only set up from 
the percolation threshold (dashed line in the figures). 

As for the WP, starting from small / we first connect 
nodes having strings with large p and these form a highly 
clustered component which already contributes to C; as / 
is increased the largest component gradually expands and 
C consistently grows; when / > f c most nodes are con- 
nected and new links serve to connect low-degree nodes 
so that the rate of growth of C is reduced; finally, when 
the network is connected (no isolated nodes) all new links 
determine an improvement in the clustering so that there 
is an acceleration in the growth of C. 

Similar arguments apply for the SP percolation: for 
small / only very weak ties are introduced and these, 
due to the relatively homogeneity of the graph (a = 0), 



7 





3 175 



350 












<*> 


^^^\ 14232 


175 

















s=11938 j\ 




h) / J 





880 

P 




FIG. 5: (Color on line) Number of nodes N(p, s) corresponding to a string with p non-null entries and belonging to a cluster 
of size s, plotted as a function of p, while different values of s and shown in different colors. The system considered has size 
V = 17500, 9 = 0, a = 0.1 and 7 = 1. The nine panels are arranged in such a way that each row represents a different dilution 
process (from top to bottom: RP, WP, SP), while each column represents a different regime (from left to right: / < f Cl f « / c , 
/ > .AO- 



are sufficient to build up a structures component which 
progressively grows determining a larger and larger C. 

In Fig. [8] we show results for the case = 0.25; due to 
the finite-size effects affecting SP, we just focus on the 
cases RP and WP. Of course, for the RP no qualitative 
changes are evidenced with respect to the case = 0, 
while for the WP, as / is increased, detached small clus- 
ters are now more likely to occur due to the sparsity of 
non-null entries in strings and this explains the fact that 
C is now qualitatively comparable with the ER case. 

Another interesting coefficient which we introduce, in 
order to monitor the evolving topology as links are re- 
moved, is a slightly modified version of the clustering 
coefficient, which we denote as C and define as the av- 
erage of the local dilution Q, given by the fraction of 
links within the subgraph containing the node i and all 
its neighbors, that is 

l—l l—l v 7 

where E{ is the number of links connecting any couple of 



nodes belonging to the subgraph, including i itself, and 
one conventionally sets C = for zi =0. We remark that 
Ci differs from C{ by the fact that here we count all links 
within the neighborhood of z, including those stemming 
from z, i.e. Ei = Ei+Zi, from which C{ = Q + X^ 2/(^ + 
1). Hence, once / fixed, by comparing C and C, one 
can derive information about the arrangement of existing 
links: either highly clusterized, so to form a small-sized 
connected component with nodes having relatively large 
degree (comparable C and C) or highly scattered, so to 
eventually form a large connected component with nodes 
having relatively small degree (large C, small C). 

As shown in Fig. [7J again for = qualitative differ- 
ences emerge between deterministic and random dilution. 
For deterministic percolations C follows a behavior simi- 
lar to C: this results from the fact that, basically, we have 
only one component which keeps on growing as / is in- 
creased, the remaining nodes being mainly isolated. Vice 
versa, for random percolation we can distinguish three 
different regimes, demarcated by two extremal points: in 
the first regime we have the emergence of several small 
components (e.g. dimers, trimers), each with small but 
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FIG. 6: (Color on line) Number of nodes N(p, s) corresponding to a string with p no n- null entries and belonging to a cluster 
of size s, plotted as a function of p, while different values of s and shown in different colors. The system considered has size 
V = 17500, 6 — 0.25, a — 0.1 and 7 = 1. The arrangement of panels is the same as in the previous figure. 



non-null contributions to C; in the second regime such 
small components start to merge and this yields a reduc- 
tion in C ; finally, when the graph has reached a connected 
status, increasing the number of links can just produce 
an increase in C. 

For = 0.25, also the WP transition display the same 
multi-regime behavior, which confirms the picture above. 



VII. CONCLUSIONS AND PERSPECTIVES 

In this work we analyzed the evolution, under percola- 
tion process, of a class of weighted graphs Q introduced 
in [I2j [13], whose topological properties arise from imita- 
tive interaction among nodes and can be properly varied 
by tuning the parameters (a, 0, 7, L). In particular, here 
we fixed a = 0.1, 7 = 1 and = or = 0.25, while the 
size (V = L/a is the number of nodes) is varied; such a 
situation corresponds, for large enough volumes, to fully- 
connected graphs, still retaining a non-trivial distribution 

-^coupl {J 5 

a, 0, 7, L) for the coupling strength J associated 
to any link. This allows to perform and compare dif- 
ferent percolation processes: random (RP, where links 
are randomly extracted for deletion), deterministic- weak 



(WP, where links are deleted starting from the weak- 
est ones) and deterministic-strong (SP, where links are 
deleted starting from the strongest ones). 

Our results highlight that weak ties are the most cru- 
cial in order to ensure the overall connection of the sys- 
tem, that is, the size of the largest component starts to be 
smaller than V when only few (weak) links are deleted. 
When approaches value 0.5 from below (and in the 
presence of finite-size effects which affect the skewness of 
the coupling distribution) one can see that the spanning 
tree underlying Q is mostly made up of weak links, while 
strong linkes are unlikely and mainly redundant. Hence 
the robustness of Q sensitively depends on which ties are 
the most prone to failure. The fact that weak ties are 
fundamental to maintain the whole graph connected is 
consistent with the so-called theory of weak-ties [28j [29] , 
according to which, in social systems, weak ties work as 
bridges between different sub-communities. 

Moreover, we showed that removing in rank order, 
from the weakest to the strongest ties, shrinks the net- 
work, but does not precipitously break it apart, in such 
a way that the percolation is rather smooth. A similar 
phenomenon has been evidenced in the context of so- 
cial networks where, when all "declared friendships" are 
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FIG. 7: (Color on line) Clustering coefficients C (top panel) 
and C (bottom panel) as a function of / and for — 0, 
V = 5700; different percolative processes are compared as ex- 
plained by the legend. The vertical dashed and dotted lines 
highlight the percolation threshold for RP and WP, respec- 
tively. 



degree-degree correlation yields qualitative changes in 
the percolation behavior as expected from [30] . 

Analysis similar to those performed here can involve 
different connecting rules (see Eq. [I] [24 j) in order to 
figure out a possible relation between the kind of inter- 
action (e.g. imitative or ant i- imitative) and the dynamic 
behavior. Also, a possible mapping between the dilution 
obtained via cutting a fraction 1 — / of links and via a 
progressive reduction of the parameter a may be figured 
out. 
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FIG. 8: (Color on line) Clustering coefficients C (top panel) 
and C (bottom panel) as a function of / and for = 0.25, 
V = 5700; different percolative processes are compared as ex- 
plained by the legend. The vertical dashed and dotted lines 
highlight the percolation threshold for RP and WP, respec- 
tively. 



considered the graph is highly connected, but when only 
"strong" links are retained, selecting firstly "maintained" 
relationships and secondly "mutual" relationships, nodes 
get gradually disconnected forming only small subclus- 
ters [19]. Conversely, as we underlined, RP gives rise to 
more structures subclusters while diluting. 

A possible extension of this work could consider non- 
complete graphs (0 > 0.5) with random deletion of nodes 
so to evaluate whether also for such correlated networks, 



In this appendix we aim to show that overpercolated 
networks Q display negative assortativity by calculating 
how the average degree of nodes belonging to the neigh- 
borhood of i depends on the degree of i itself. In fact, we 
can write that in a graph G(a, 7, L) the probability for 
a node i to have neighbors which display in the average 
z neighbors is 

^deg-deg(^; Pii a j Li) = 
1 L 

77 ^2 p \mk(pi,Pj,L)P deg (z;pj,a, V)Pi(pj',a,L), 



Pj =i 



where M is the normalization factor, P\mk(pi, Pj,L) = 
1 — -Pmatch(0; Pi, Pj, L) is the probability that there ex- 
ists a link connecting i and j and P deg (z; Pj,a, V) = 
(^)[flink(Pj;a)Hl - Piink(pj;a)} y - Z is the probability 
that node j has z neighbors. Hence one finds that the 
average degree for z's neighbors is 

v 

z{pi\ a,L) = p deg-de g (>; Pu a, L)z = 

1 L 



Pj =i 



being z(pj;a, V) = V{1 — [(1— a)/2] pj } the average degree 
for node j. With some algebra one gets to 



z(pi;a,L) = 1 

1 ■ 2 



1 — a\ L /3 + a x L 



1 — a 



(Al) 
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Now, noticing that < (1 — a)/2 < 2/(3 + a) < 1, we can 
deduce that z(pi\a, L,V) is decreasing with p^, namely 
with z(pi]a,L,V), so that, as long as the mean-field ap- 
proach developed here is valid fl2j [13] , the graph displays 
dissortativity. 

APPENDIX B: ANALYTICAL RESULTS ON RP 



7 = 7(a/L) 6> /2, we can write 

[1_(1_ 7 2)L] L 

+ (l-27 2 +7 3 ) L l r ^z, (B4) 



The percolation problem has been studied over differ- 
ent kinds of structure, both analytically and numerically 
[31] [32] ; in particular, within the so-called configuration 
model approach [33] [34] , we can exploit the generating 
function formalism to get some insights into the problem. 
First of all, being Pd egree (k) the average degree distribu- 
tion for the generic graph Q (here we drop the dependence 
on the parameter set to lighten the notation), we define 

CO CO 

Go(x) = ^P degree (k)x\ G 1 (x) = ^2Q(k)x\ (Bl) 



k=0 



where Q(k) = {k + l)Pdegree(^ + 1)/^- Now, as- 
suming V large and the clustering not significantly, 
Q(k) is the so-called excess degree [34], represent- 
ing the degree distribution of the vertex at the end 
of a randomly chosen edge; notice that Gq(1) = z 
and G\(x) = G f (x)/z. Recalling that Pdegree(^) = 
£p=o(D [l-m P ] k m PiV - k) Pi( P ;a,L), (see 
P2] H3] ) , we can write 



G (x) = yP 1 (p;a,L) 



P =o 



x + (1 - x) 



I- a 



Moreover, for uniform link deletion probability, the mean 
cluster is [35] 



s=l + fG' (l) 



f 2 G' (l) 



(B2) 



which diverges when 1 — f G[(l) — 0; this point marks 
the percolation threshold of the system: for / > f c = 
1/G[(1) a giant component of connected vertices is es- 
tablished. Therefore, consistently with the Molloy-Reed 
criterion [36], when G[(l) < 1 the graph consists of many 
small components, while when C?i(l) > 1 a giant compo- 
nent can emerge. Here we find 



GUI) 



G'o'(l) 



[1 - h{a)]< 



[l-2fc(a)+0(a)], (B3) 



where h(a) = [(3 - 2a - a 2 )/4] L = p and g(a) = [(1 - 
a) (5 — a 2 )/8] L . Assuming a = — 1 + j(a/L) e and posing 




FIG. 9: (Color on line) Natural logarithm of G[ (1) for L = 20 
, 6 = 0.5 and different values of 7 and a as given by Eq. |B4| 
Whenever G ; (l) > 1 a giant component emerges. 



in analogy with the percolation threshold expected for 
the ER graph. In Fig. [9] we show G^l) as a function of 
7 and a and for a finite value of L. 

Finally, the formalism developed in the first part of 
this appendix has also been used to find an expression 
for the global clustering coefficient or transitivity of the 
configuration model [37] : 



37V, 



A 



V 



1 

Vz 



G' (l)Gi(l) 



(B5) 

where is the number of triangles in the network and 
N3 is the number of connected triples of vertices [34] . 
Notice that from Eq. |B5[ c is given by the coefficient 
expected for the ER graph, namely z/V, times an extra 
factor such that when the degree distribution is highly 
skewed, given that the factor z 2 /z 2 can be rather large, c 
is not necessarily negligible for the graph sizes relatively 
large. Interestingly, we find 



[1 - 2h(a) + g(a) 

rap 



212 



(B6) 



which, for a G [—1, 1] is always larger than p = 1 — ft (a), 
hence confirming the large degree of cliquishness of Q. 
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